Dataset statistics
| Number of variables | 22 |
|---|---|
| Number of observations | 200000 |
| Missing cells | 766339 |
| Missing cells (%) | 17.4% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 33.6 MiB |
| Average record size in memory | 176.0 B |
Variable types
| Text | 2 |
|---|---|
| DateTime | 2 |
| Numeric | 15 |
| Categorical | 3 |
dropoff_census_tract is highly overall correlated with dropoff_community_area and 3 other fields | High correlation |
dropoff_community_area is highly overall correlated with dropoff_census_tract and 3 other fields | High correlation |
dropoff_latitude is highly overall correlated with dropoff_census_tract and 4 other fields | High correlation |
dropoff_location is highly overall correlated with dropoff_census_tract and 3 other fields | High correlation |
dropoff_longitude is highly overall correlated with dropoff_census_tract and 3 other fields | High correlation |
extras is highly overall correlated with trip_total | High correlation |
fare is highly overall correlated with trip_miles and 2 other fields | High correlation |
payment_type is highly overall correlated with tips | High correlation |
pickup_census_tract is highly overall correlated with pickup_community_area | High correlation |
pickup_community_area is highly overall correlated with pickup_census_tract | High correlation |
pickup_latitude is highly overall correlated with dropoff_latitude and 1 other fields | High correlation |
pickup_longitude is highly overall correlated with pickup_latitude | High correlation |
tips is highly overall correlated with payment_type | High correlation |
trip_miles is highly overall correlated with fare and 2 other fields | High correlation |
trip_seconds is highly overall correlated with fare and 2 other fields | High correlation |
trip_total is highly overall correlated with extras and 3 other fields | High correlation |
payment_type is highly imbalanced (62.8%) | Imbalance |
pickup_census_tract has 111053 (55.5%) missing values | Missing |
dropoff_census_tract has 122048 (61.0%) missing values | Missing |
pickup_community_area has 29017 (14.5%) missing values | Missing |
dropoff_community_area has 73327 (36.7%) missing values | Missing |
tolls has 30137 (15.1%) missing values | Missing |
company has 93668 (46.8%) missing values | Missing |
pickup_latitude has 29002 (14.5%) missing values | Missing |
pickup_longitude has 29002 (14.5%) missing values | Missing |
pickup_location has 29002 (14.5%) missing values | Missing |
dropoff_latitude has 73327 (36.7%) missing values | Missing |
dropoff_longitude has 73327 (36.7%) missing values | Missing |
dropoff_location has 73327 (36.7%) missing values | Missing |
trip_seconds is highly skewed (γ1 = 21.75519908) | Skewed |
fare is highly skewed (γ1 = 148.025731) | Skewed |
tolls is highly skewed (γ1 = 125.2660433) | Skewed |
trip_total is highly skewed (γ1 = 124.9842732) | Skewed |
unique_key has unique values | Unique |
trip_seconds has 3410 (1.7%) zeros | Zeros |
trip_miles has 12673 (6.3%) zeros | Zeros |
tips has 137559 (68.8%) zeros | Zeros |
tolls has 169784 (84.9%) zeros | Zeros |
extras has 116988 (58.5%) zeros | Zeros |
Reproduction
| Analysis started | 2024-02-25 08:07:07.974798 |
|---|---|
| Analysis finished | 2024-02-25 08:15:28.652520 |
| Duration | 8 minutes and 20.68 seconds |
| Software version | ydata-profiling vv4.6.4 |
| Download configuration | config.json |
unique_key
Text
UNIQUE 
| Distinct | 200000 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
Length
| Max length | 40 |
|---|---|
| Median length | 40 |
| Mean length | 40 |
| Min length | 40 |
Characters and Unicode
| Total characters | 8000000 |
|---|---|
| Distinct characters | 16 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 200000 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | e2afbb4f62fb3865c4d928a39e8a4d1e711ea8da |
|---|---|
| 2nd row | a663e8249660b7a3ae13c9a39378ff495564e8e6 |
| 3rd row | a99a7309aea3cf70eed1644c254eb30b150ac4f2 |
| 4th row | ad0d9e702d67b0e9e7b85dd0750605ff06389c4f |
| 5th row | 350b48036f17d07f79abf53d04b811da8b6c264c |
| Value | Count | Frequency (%) |
| e2afbb4f62fb3865c4d928a39e8a4d1e711ea8da | 1 | < 0.1% |
| 1cd673ed462eb277d26daef0f64acb71b5fa4ab7 | 1 | < 0.1% |
| b7481a3988dd29243a429937ea9184c4252cd23a | 1 | < 0.1% |
| af14f627846d939d75957e76c4fb6cc6f94592e9 | 1 | < 0.1% |
| a99a7309aea3cf70eed1644c254eb30b150ac4f2 | 1 | < 0.1% |
| ad0d9e702d67b0e9e7b85dd0750605ff06389c4f | 1 | < 0.1% |
| 350b48036f17d07f79abf53d04b811da8b6c264c | 1 | < 0.1% |
| b5d0ec1472abae045f794a581f42364213ef79ab | 1 | < 0.1% |
| ac7ce885bf43a27e0e22de0c1e8efd98ebdd1571 | 1 | < 0.1% |
| 68088a5a378c45781ece9cdb1eebdc2afc4047d3 | 1 | < 0.1% |
| Other values (199990) | 199990 |
Most occurring characters
| Value | Count | Frequency (%) |
| 8 | 501071 | 6.3% |
| 4 | 500914 | 6.3% |
| a | 500776 | 6.3% |
| 2 | 500765 | 6.3% |
| f | 500435 | 6.3% |
| 5 | 500110 | 6.3% |
| 3 | 500003 | 6.3% |
| e | 499895 | 6.2% |
| 9 | 499801 | 6.2% |
| c | 499765 | 6.2% |
| Other values (6) | 2996465 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 5000241 | |
| Lowercase Letter | 2999759 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 8 | 501071 | |
| 4 | 500914 | |
| 2 | 500765 | |
| 5 | 500110 | |
| 3 | 500003 | |
| 9 | 499801 | |
| 6 | 499612 | |
| 0 | 499603 | |
| 1 | 499443 | |
| 7 | 498919 |
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 500776 | |
| f | 500435 | |
| e | 499895 | |
| c | 499765 | |
| d | 499488 | |
| b | 499400 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 5000241 | |
| Latin | 2999759 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 8 | 501071 | |
| 4 | 500914 | |
| 2 | 500765 | |
| 5 | 500110 | |
| 3 | 500003 | |
| 9 | 499801 | |
| 6 | 499612 | |
| 0 | 499603 | |
| 1 | 499443 | |
| 7 | 498919 |
Latin
| Value | Count | Frequency (%) |
| a | 500776 | |
| f | 500435 | |
| e | 499895 | |
| c | 499765 | |
| d | 499488 | |
| b | 499400 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8000000 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 8 | 501071 | 6.3% |
| 4 | 500914 | 6.3% |
| a | 500776 | 6.3% |
| 2 | 500765 | 6.3% |
| f | 500435 | 6.3% |
| 5 | 500110 | 6.3% |
| 3 | 500003 | 6.3% |
| e | 499895 | 6.2% |
| 9 | 499801 | 6.2% |
| c | 499765 | 6.2% |
| Other values (6) | 2996465 |
| Distinct | 31737 |
|---|---|
| Distinct (%) | 15.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| Minimum | 2014-04-01 00:00:00+00:00 |
|---|---|
| Maximum | 2018-07-23 15:00:00+00:00 |
| Distinct | 31646 |
|---|---|
| Distinct (%) | 15.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| Minimum | 2014-04-01 00:30:00+00:00 |
|---|---|
| Maximum | 2018-07-23 15:00:00+00:00 |
trip_seconds
Real number (ℝ)
HIGH CORRELATION  SKEWED  ZEROS 
| Distinct | 4564 |
|---|---|
| Distinct (%) | 2.3% |
| Missing | 43 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1017.3965 |
| Minimum | 0 |
|---|---|
| Maximum | 86340 |
| Zeros | 3410 |
| Zeros (%) | 1.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 7 |
| Q1 | 420 |
| median | 720 |
| Q3 | 1200 |
| 95-th percentile | 2473 |
| Maximum | 86340 |
| Range | 86340 |
| Interquartile range (IQR) | 780 |
Descriptive statistics
| Standard deviation | 2151.8552 |
|---|---|
| Coefficient of variation (CV) | 2.1150607 |
| Kurtosis | 604.92366 |
| Mean | 1017.3965 |
| Median Absolute Deviation (MAD) | 360 |
| Skewness | 21.755199 |
| Sum | 2.0343554 × 108 |
| Variance | 4630481 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 540 | 7604 | 3.8% |
| 480 | 7554 | 3.8% |
| 600 | 7547 | 3.8% |
| 660 | 7171 | 3.6% |
| 420 | 6849 | 3.4% |
| 720 | 6389 | 3.2% |
| 360 | 6142 | 3.1% |
| 780 | 5748 | 2.9% |
| 840 | 5379 | 2.7% |
| 300 | 5083 | 2.5% |
| Other values (4554) | 134491 |
| Value | Count | Frequency (%) |
| 0 | 3410 | |
| 1 | 3160 | |
| 2 | 1377 | |
| 3 | 771 | 0.4% |
| 4 | 577 | 0.3% |
| 5 | 309 | 0.2% |
| 6 | 219 | 0.1% |
| 7 | 219 | 0.1% |
| 8 | 209 | 0.1% |
| 9 | 151 | 0.1% |
| Value | Count | Frequency (%) |
| 86340 | 3 | |
| 85633 | 1 | < 0.1% |
| 85200 | 1 | < 0.1% |
| 84120 | 1 | < 0.1% |
| 83296 | 1 | < 0.1% |
| 82800 | 1 | < 0.1% |
| 81960 | 1 | < 0.1% |
| 80569 | 1 | < 0.1% |
| 80100 | 1 | < 0.1% |
| 79219 | 1 | < 0.1% |
trip_miles
Real number (ℝ)
HIGH CORRELATION  ZEROS 
| Distinct | 3487 |
|---|---|
| Distinct (%) | 1.7% |
| Missing | 11 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.5430392 |
| Minimum | 0 |
|---|---|
| Maximum | 388.1 |
| Zeros | 12673 |
| Zeros (%) | 6.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1.3 |
| median | 2.5 |
| Q3 | 7 |
| 95-th percentile | 19.6 |
| Maximum | 388.1 |
| Range | 388.1 |
| Interquartile range (IQR) | 5.7 |
Descriptive statistics
| Standard deviation | 7.6250346 |
|---|---|
| Coefficient of variation (CV) | 1.3756054 |
| Kurtosis | 146.13869 |
| Mean | 5.5430392 |
| Median Absolute Deviation (MAD) | 1.7 |
| Skewness | 6.1774545 |
| Sum | 1108546.9 |
| Variance | 58.141152 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 12673 | 6.3% |
| 1.7 | 3575 | 1.8% |
| 1.1 | 3458 | 1.7% |
| 1.2 | 3440 | 1.7% |
| 1.3 | 3373 | 1.7% |
| 1.5 | 3328 | 1.7% |
| 1.6 | 3323 | 1.7% |
| 2 | 3279 | 1.6% |
| 1.8 | 3250 | 1.6% |
| 1.4 | 3244 | 1.6% |
| Other values (3477) | 157046 |
| Value | Count | Frequency (%) |
| 0 | 12673 | |
| 0.01 | 88 | < 0.1% |
| 0.02 | 51 | < 0.1% |
| 0.03 | 53 | < 0.1% |
| 0.04 | 64 | < 0.1% |
| 0.05 | 67 | < 0.1% |
| 0.06 | 83 | < 0.1% |
| 0.07 | 117 | 0.1% |
| 0.08 | 95 | < 0.1% |
| 0.09 | 109 | 0.1% |
| Value | Count | Frequency (%) |
| 388.1 | 1 | |
| 361.6 | 1 | |
| 303.1 | 1 | |
| 293.2 | 1 | |
| 287.1 | 1 | |
| 274.5 | 1 | |
| 256.3 | 1 | |
| 250.3 | 1 | |
| 248.8 | 1 | |
| 210.36 | 1 |
pickup_census_tract
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 328 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 111053 |
| Missing (%) | 55.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.7031366 × 1010 |
| Minimum | 1.703101 × 1010 |
|---|---|
| Maximum | 1.703198 × 1010 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 1.703101 × 1010 |
|---|---|
| 5-th percentile | 1.7031072 × 1010 |
| Q1 | 1.7031081 × 1010 |
| median | 1.7031282 × 1010 |
| Q3 | 1.7031831 × 1010 |
| 95-th percentile | 1.703198 × 1010 |
| Maximum | 1.703198 × 1010 |
| Range | 969898 |
| Interquartile range (IQR) | 749697 |
Descriptive statistics
| Standard deviation | 336604.07 |
|---|---|
| Coefficient of variation (CV) | 1.9763774 × 10-5 |
| Kurtosis | -0.89605857 |
| Mean | 1.7031366 × 1010 |
| Median Absolute Deviation (MAD) | 200497 |
| Skewness | 0.84754245 |
| Sum | 1.5148889 × 1015 |
| Variance | 1.133023 × 1011 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1.70313201 × 1010 | 10377 | 5.2% |
| 1.703198 × 1010 | 8874 | 4.4% |
| 1.70318391 × 1010 | 8163 | 4.1% |
| 1.70310815 × 1010 | 7072 | 3.5% |
| 1.70313204 × 1010 | 4870 | 2.4% |
| 1.70312819 × 1010 | 4673 | 2.3% |
| 1.70310814 × 1010 | 4555 | 2.3% |
| 1.70310817 × 1010 | 3862 | 1.9% |
| 1.70313301 × 1010 | 3095 | 1.5% |
| 1.70310814 × 1010 | 2984 | 1.5% |
| Other values (318) | 30422 | 15.2% |
| (Missing) | 111053 |
| Value | Count | Frequency (%) |
| 1.70310102 × 1010 | 1 | < 0.1% |
| 1.70310104 × 1010 | 3 | < 0.1% |
| 1.70310105 × 1010 | 1 | < 0.1% |
| 1.70310105 × 1010 | 1 | < 0.1% |
| 1.70310105 × 1010 | 2 | < 0.1% |
| 1.70310106 × 1010 | 1 | < 0.1% |
| 1.70310201 × 1010 | 1 | < 0.1% |
| 1.70310202 × 1010 | 35 | |
| 1.70310206 × 1010 | 3 | < 0.1% |
| 1.70310208 × 1010 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 1.70319801 × 1010 | 1903 | 1.0% |
| 1.703198 × 1010 | 8874 | |
| 1.70318437 × 1010 | 5 | < 0.1% |
| 1.70318432 × 1010 | 1 | < 0.1% |
| 1.70318423 × 1010 | 125 | 0.1% |
| 1.70318422 × 1010 | 231 | 0.1% |
| 1.70318419 × 1010 | 176 | 0.1% |
| 1.70318411 × 1010 | 16 | < 0.1% |
| 1.7031841 × 1010 | 585 | 0.3% |
| 1.70318403 × 1010 | 6 | < 0.1% |
dropoff_census_tract
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 222 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 122048 |
| Missing (%) | 61.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.7031274 × 1010 |
| Minimum | 1.703101 × 1010 |
|---|---|
| Maximum | 1.703184 × 1010 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 1.703101 × 1010 |
|---|---|
| 5-th percentile | 1.7031062 × 1010 |
| Q1 | 1.7031282 × 1010 |
| median | 1.7031282 × 1010 |
| Q3 | 1.703133 × 1010 |
| 95-th percentile | 1.703133 × 1010 |
| Maximum | 1.703184 × 1010 |
| Range | 830000 |
| Interquartile range (IQR) | 48200 |
Descriptive statistics
| Standard deviation | 143493.56 |
|---|---|
| Coefficient of variation (CV) | 8.4252978 × 10-6 |
| Kurtosis | 5.2735237 |
| Mean | 1.7031274 × 1010 |
| Median Absolute Deviation (MAD) | 48200 |
| Skewness | 1.3979693 |
| Sum | 1.3276219 × 1015 |
| Variance | 2.0590401 × 1010 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1.70312819 × 1010 | 31775 | 15.9% |
| 1.70313301 × 1010 | 26501 | 13.3% |
| 1.70310714 × 1010 | 7071 | 3.5% |
| 1.70310802 × 1010 | 3493 | 1.7% |
| 1.70310619 × 1010 | 2221 | 1.1% |
| 1.70310819 × 1010 | 1594 | 0.8% |
| 1.70310623 × 1010 | 888 | 0.4% |
| 1.70310502 × 1010 | 607 | 0.3% |
| 1.70318094 × 1010 | 443 | 0.2% |
| 1.70318311 × 1010 | 378 | 0.2% |
| Other values (212) | 2981 | 1.5% |
| (Missing) | 122048 |
| Value | Count | Frequency (%) |
| 1.70310104 × 1010 | 127 | 0.1% |
| 1.70310107 × 1010 | 6 | < 0.1% |
| 1.70310502 × 1010 | 607 | 0.3% |
| 1.70310618 × 1010 | 287 | 0.1% |
| 1.70310619 × 1010 | 2221 | 1.1% |
| 1.70310623 × 1010 | 888 | 0.4% |
| 1.70310714 × 1010 | 7071 | |
| 1.70310802 × 1010 | 3493 | |
| 1.70310819 × 1010 | 1594 | 0.8% |
| 1.70311105 × 1010 | 38 | < 0.1% |
| Value | Count | Frequency (%) |
| 1.70318404 × 1010 | 1 | < 0.1% |
| 1.703184 × 1010 | 3 | < 0.1% |
| 1.70318374 × 1010 | 7 | < 0.1% |
| 1.70318359 × 1010 | 2 | < 0.1% |
| 1.70318314 × 1010 | 1 | < 0.1% |
| 1.70318311 × 1010 | 378 | |
| 1.703183001 × 1010 | 5 | < 0.1% |
| 1.703183001 × 1010 | 2 | < 0.1% |
| 1.703183 × 1010 | 6 | < 0.1% |
| 1.703183 × 1010 | 60 | < 0.1% |
pickup_community_area
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 77 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 29017 |
| Missing (%) | 14.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 27.914453 |
| Minimum | 1 |
|---|---|
| Maximum | 77 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 5 |
| Q1 | 8 |
| median | 24 |
| Q3 | 32 |
| 95-th percentile | 76 |
| Maximum | 77 |
| Range | 76 |
| Interquartile range (IQR) | 24 |
Descriptive statistics
| Standard deviation | 24.662098 |
|---|---|
| Coefficient of variation (CV) | 0.88348847 |
| Kurtosis | -0.33557872 |
| Mean | 27.914453 |
| Median Absolute Deviation (MAD) | 16 |
| Skewness | 0.97935535 |
| Sum | 4772897 |
| Variance | 608.21906 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 8 | 47925 | |
| 32 | 32779 | |
| 76 | 25155 | |
| 6 | 11149 | 5.6% |
| 28 | 10553 | 5.3% |
| 7 | 7671 | 3.8% |
| 3 | 5249 | 2.6% |
| 24 | 4923 | 2.5% |
| 56 | 4872 | 2.4% |
| 33 | 4495 | 2.2% |
| Other values (67) | 16212 | 8.1% |
| (Missing) | 29017 |
| Value | Count | Frequency (%) |
| 1 | 1048 | 0.5% |
| 2 | 934 | 0.5% |
| 3 | 5249 | 2.6% |
| 4 | 1135 | 0.6% |
| 5 | 1101 | 0.6% |
| 6 | 11149 | 5.6% |
| 7 | 7671 | 3.8% |
| 8 | 47925 | |
| 9 | 18 | < 0.1% |
| 10 | 113 | 0.1% |
| Value | Count | Frequency (%) |
| 77 | 2564 | 1.3% |
| 76 | 25155 | |
| 75 | 14 | < 0.1% |
| 74 | 3 | < 0.1% |
| 73 | 8 | < 0.1% |
| 72 | 3 | < 0.1% |
| 71 | 8 | < 0.1% |
| 70 | 26 | < 0.1% |
| 69 | 25 | < 0.1% |
| 68 | 10 | < 0.1% |
dropoff_community_area
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 27 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 73327 |
| Missing (%) | 36.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 19.381131 |
| Minimum | 1 |
|---|---|
| Maximum | 60 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 3 |
| Q1 | 8 |
| median | 28 |
| Q3 | 33 |
| 95-th percentile | 33 |
| Maximum | 60 |
| Range | 59 |
| Interquartile range (IQR) | 25 |
Descriptive statistics
| Standard deviation | 12.42253 |
|---|---|
| Coefficient of variation (CV) | 0.64096002 |
| Kurtosis | -1.6406971 |
| Mean | 19.381131 |
| Median Absolute Deviation (MAD) | 13 |
| Skewness | 0.047157273 |
| Sum | 2455066 |
| Variance | 154.31925 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 8 | 32667 | |
| 28 | 31775 | |
| 33 | 26501 | 13.3% |
| 3 | 13049 | 6.5% |
| 7 | 7071 | 3.5% |
| 14 | 4907 | 2.5% |
| 41 | 4739 | 2.4% |
| 6 | 3396 | 1.7% |
| 29 | 762 | 0.4% |
| 5 | 607 | 0.3% |
| Other values (17) | 1199 | 0.6% |
| (Missing) | 73327 |
| Value | Count | Frequency (%) |
| 1 | 133 | 0.1% |
| 3 | 13049 | 6.5% |
| 5 | 607 | 0.3% |
| 6 | 3396 | 1.7% |
| 7 | 7071 | 3.5% |
| 8 | 32667 | |
| 11 | 38 | < 0.1% |
| 14 | 4907 | 2.5% |
| 21 | 378 | 0.2% |
| 24 | 117 | 0.1% |
| Value | Count | Frequency (%) |
| 60 | 3 | < 0.1% |
| 59 | 1 | < 0.1% |
| 54 | 7 | < 0.1% |
| 52 | 27 | < 0.1% |
| 47 | 35 | < 0.1% |
| 46 | 155 | 0.1% |
| 42 | 7 | < 0.1% |
| 41 | 4739 | |
| 38 | 2 | < 0.1% |
| 35 | 1 | < 0.1% |
fare
Real number (ℝ)
HIGH CORRELATION  SKEWED 
| Distinct | 1507 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 12 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 15.666398 |
| Minimum | 0 |
|---|---|
| Maximum | 9211.59 |
| Zeros | 182 |
| Zeros (%) | 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 6.75 |
| median | 10 |
| Q3 | 18.75 |
| 95-th percentile | 45 |
| Maximum | 9211.59 |
| Range | 9211.59 |
| Interquartile range (IQR) | 12 |
Descriptive statistics
| Standard deviation | 41.823396 |
|---|---|
| Coefficient of variation (CV) | 2.6696242 |
| Kurtosis | 27949.116 |
| Mean | 15.666398 |
| Median Absolute Deviation (MAD) | 4.25 |
| Skewness | 148.02573 |
| Sum | 3133091.7 |
| Variance | 1749.1965 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.01 | 7504 | 3.8% |
| 2 | 4439 | 2.2% |
| 3.25 | 3895 | 1.9% |
| 7.25 | 3427 | 1.7% |
| 8.25 | 3406 | 1.7% |
| 9.25 | 3120 | 1.6% |
| 6.25 | 2903 | 1.5% |
| 10.25 | 2692 | 1.3% |
| 8 | 2161 | 1.1% |
| 5.25 | 2147 | 1.1% |
| Other values (1497) | 164294 |
| Value | Count | Frequency (%) |
| 0 | 182 | 0.1% |
| 0.01 | 7504 | |
| 0.07 | 1 | < 0.1% |
| 0.08 | 1 | < 0.1% |
| 0.1 | 2 | < 0.1% |
| 0.2 | 2 | < 0.1% |
| 0.27 | 1 | < 0.1% |
| 0.32 | 2 | < 0.1% |
| 0.5 | 1 | < 0.1% |
| 0.51 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 9211.59 | 1 | |
| 9000.9 | 1 | |
| 6300.35 | 1 | |
| 5130.63 | 1 | |
| 5004.74 | 1 | |
| 4002.78 | 1 | |
| 4001.15 | 1 | |
| 2800.17 | 1 | |
| 1081.4 | 1 | |
| 996.63 | 1 |
tips
Real number (ℝ)
HIGH CORRELATION  ZEROS 
| Distinct | 2018 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 12 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.4819713 |
| Minimum | 0 |
|---|---|
| Maximum | 199 |
| Zeros | 137559 |
| Zeros (%) | 68.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 2 |
| 95-th percentile | 8.05 |
| Maximum | 199 |
| Range | 199 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 3.4666989 |
|---|---|
| Coefficient of variation (CV) | 2.3392483 |
| Kurtosis | 169.1816 |
| Mean | 1.4819713 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 6.9602832 |
| Sum | 296376.48 |
| Variance | 12.018001 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 137559 | |
| 2 | 17981 | 9.0% |
| 3 | 7900 | 4.0% |
| 1 | 3639 | 1.8% |
| 5 | 2050 | 1.0% |
| 4 | 1766 | 0.9% |
| 10 | 992 | 0.5% |
| 1.5 | 663 | 0.3% |
| 6 | 542 | 0.3% |
| 7 | 526 | 0.3% |
| Other values (2008) | 26370 | 13.2% |
| Value | Count | Frequency (%) |
| 0 | 137559 | |
| 0.01 | 24 | < 0.1% |
| 0.02 | 13 | < 0.1% |
| 0.03 | 5 | < 0.1% |
| 0.04 | 1 | < 0.1% |
| 0.05 | 2 | < 0.1% |
| 0.07 | 7 | < 0.1% |
| 0.1 | 27 | < 0.1% |
| 0.11 | 2 | < 0.1% |
| 0.15 | 31 | < 0.1% |
| Value | Count | Frequency (%) |
| 199 | 1 | |
| 180 | 1 | |
| 155 | 1 | |
| 145 | 1 | |
| 126.05 | 1 | |
| 105 | 1 | |
| 100 | 1 | |
| 96 | 1 | |
| 75 | 2 | |
| 67 | 1 |
tolls
Real number (ℝ)
MISSING  SKEWED  ZEROS 
| Distinct | 23 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 30137 |
| Missing (%) | 15.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.0049993819 |
| Minimum | 0 |
|---|---|
| Maximum | 75 |
| Zeros | 169784 |
| Zeros (%) | 84.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 75 |
| Range | 75 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.48559431 |
|---|---|
| Coefficient of variation (CV) | 97.13087 |
| Kurtosis | 16583.991 |
| Mean | 0.0049993819 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 125.26604 |
| Sum | 849.21 |
| Variance | 0.23580183 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 169784 | |
| 1.9 | 21 | < 0.1% |
| 1.5 | 10 | < 0.1% |
| 3 | 7 | < 0.1% |
| 50 | 7 | < 0.1% |
| 2 | 6 | < 0.1% |
| 4 | 4 | < 0.1% |
| 75 | 3 | < 0.1% |
| 2.1 | 3 | < 0.1% |
| 3.8 | 2 | < 0.1% |
| Other values (13) | 16 | < 0.1% |
| (Missing) | 30137 | 15.1% |
| Value | Count | Frequency (%) |
| 0 | 169784 | |
| 0.9 | 1 | < 0.1% |
| 1.5 | 10 | < 0.1% |
| 1.6 | 1 | < 0.1% |
| 1.9 | 21 | < 0.1% |
| 2 | 6 | < 0.1% |
| 2.1 | 3 | < 0.1% |
| 2.4 | 2 | < 0.1% |
| 2.5 | 1 | < 0.1% |
| 2.7 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 75 | 3 | |
| 64.5 | 1 | < 0.1% |
| 50 | 7 | |
| 28.81 | 1 | < 0.1% |
| 12.4 | 1 | < 0.1% |
| 8 | 1 | < 0.1% |
| 6 | 2 | < 0.1% |
| 5 | 1 | < 0.1% |
| 4.5 | 2 | < 0.1% |
| 4.2 | 1 | < 0.1% |
extras
Real number (ℝ)
HIGH CORRELATION  ZEROS 
| Distinct | 246 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 12 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.1075579 |
| Minimum | 0 |
|---|---|
| Maximum | 99.5 |
| Zeros | 116988 |
| Zeros (%) | 58.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 1 |
| 95-th percentile | 14 |
| Maximum | 99.5 |
| Range | 99.5 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 5.9408025 |
|---|---|
| Coefficient of variation (CV) | 2.8188087 |
| Kurtosis | 28.642353 |
| Mean | 2.1075579 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 4.7540089 |
| Sum | 421486.29 |
| Variance | 35.293135 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 116988 | |
| 1 | 32402 | 16.2% |
| 2 | 11467 | 5.7% |
| 1.5 | 7813 | 3.9% |
| 4 | 6723 | 3.4% |
| 3 | 3010 | 1.5% |
| 5 | 2641 | 1.3% |
| 6 | 1351 | 0.7% |
| 2.5 | 1234 | 0.6% |
| 3.5 | 1128 | 0.6% |
| Other values (236) | 15231 | 7.6% |
| Value | Count | Frequency (%) |
| 0 | 116988 | |
| 0.02 | 1 | < 0.1% |
| 0.04 | 1 | < 0.1% |
| 0.1 | 2 | < 0.1% |
| 0.25 | 1 | < 0.1% |
| 0.5 | 615 | 0.3% |
| 0.75 | 148 | 0.1% |
| 0.9 | 1 | < 0.1% |
| 1 | 32402 | 16.2% |
| 1.5 | 7813 | 3.9% |
| Value | Count | Frequency (%) |
| 99.5 | 6 | |
| 98 | 1 | < 0.1% |
| 96 | 1 | < 0.1% |
| 93.5 | 1 | < 0.1% |
| 90 | 3 | |
| 89 | 1 | < 0.1% |
| 88.5 | 1 | < 0.1% |
| 87 | 2 | < 0.1% |
| 86.5 | 1 | < 0.1% |
| 86 | 2 | < 0.1% |
trip_total
Real number (ℝ)
HIGH CORRELATION  SKEWED 
| Distinct | 5163 |
|---|---|
| Distinct (%) | 2.6% |
| Missing | 12 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 19.34244 |
| Minimum | 0 |
|---|---|
| Maximum | 9299.25 |
| Zeros | 179 |
| Zeros (%) | 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 7.65 |
| median | 11.5 |
| Q3 | 21.05 |
| 95-th percentile | 63 |
| Maximum | 9299.25 |
| Range | 9299.25 |
| Interquartile range (IQR) | 13.4 |
Descriptive statistics
| Standard deviation | 44.482694 |
|---|---|
| Coefficient of variation (CV) | 2.2997458 |
| Kurtosis | 22245.485 |
| Mean | 19.34244 |
| Median Absolute Deviation (MAD) | 5 |
| Skewness | 124.98427 |
| Sum | 3868255.8 |
| Variance | 1978.7101 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.01 | 6643 | 3.3% |
| 2 | 4157 | 2.1% |
| 3.25 | 3073 | 1.5% |
| 9.25 | 2757 | 1.4% |
| 10.25 | 2709 | 1.4% |
| 8.25 | 2652 | 1.3% |
| 7.25 | 2389 | 1.2% |
| 11.25 | 2313 | 1.2% |
| 12.25 | 1993 | 1.0% |
| 6.25 | 1902 | 1.0% |
| Other values (5153) | 169400 |
| Value | Count | Frequency (%) |
| 0 | 179 | 0.1% |
| 0.01 | 6643 | |
| 0.08 | 1 | < 0.1% |
| 0.1 | 2 | < 0.1% |
| 0.2 | 2 | < 0.1% |
| 0.27 | 1 | < 0.1% |
| 0.5 | 1 | < 0.1% |
| 0.51 | 2 | < 0.1% |
| 0.52 | 1 | < 0.1% |
| 0.75 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 9299.25 | 1 | |
| 9001 | 1 | |
| 6300.39 | 1 | |
| 5180.63 | 1 | |
| 5057.54 | 1 | |
| 4054.58 | 1 | |
| 4001.15 | 1 | |
| 2850.19 | 1 | |
| 1081.4 | 1 | |
| 996.63 | 1 |
payment_type
Categorical
HIGH CORRELATION  IMBALANCE 
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.5 MiB |
| Cash | |
|---|---|
| Credit Card | |
| Prcard | 431 |
| Mobile | 74 |
| Pcard | 65 |
Length
| Max length | 11 |
|---|---|
| Median length | 4 |
| Mean length | 6.4177 |
| Min length | 4 |
Characters and Unicode
| Total characters | 1283540 |
|---|---|
| Distinct characters | 18 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Cash |
|---|---|
| 2nd row | Cash |
| 3rd row | Cash |
| 4th row | Cash |
| 5th row | Cash |
Common Values
| Value | Count | Frequency (%) |
| Cash | 130485 | |
| Credit Card | 68920 | |
| Prcard | 431 | 0.2% |
| Mobile | 74 | < 0.1% |
| Pcard | 65 | < 0.1% |
| Split | 25 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| cash | 130485 | |
| credit | 68920 | |
| card | 68920 | |
| prcard | 431 | 0.2% |
| mobile | 74 | < 0.1% |
| pcard | 65 | < 0.1% |
| split | 25 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| C | 268325 | |
| a | 199901 | |
| r | 138767 | |
| d | 138336 | |
| s | 130485 | |
| h | 130485 | |
| i | 69019 | 5.4% |
| e | 68994 | 5.4% |
| t | 68945 | 5.4% |
| 68920 | 5.4% | |
| Other values (8) | 1363 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 945700 | |
| Uppercase Letter | 268920 | 21.0% |
| Space Separator | 68920 | 5.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 199901 | |
| r | 138767 | |
| d | 138336 | |
| s | 130485 | |
| h | 130485 | |
| i | 69019 | 7.3% |
| e | 68994 | 7.3% |
| t | 68945 | 7.3% |
| c | 496 | 0.1% |
| l | 99 | < 0.1% |
| Other values (3) | 173 | < 0.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 268325 | |
| P | 496 | 0.2% |
| M | 74 | < 0.1% |
| S | 25 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 68920 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1214620 | |
| Common | 68920 | 5.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| C | 268325 | |
| a | 199901 | |
| r | 138767 | |
| d | 138336 | |
| s | 130485 | |
| h | 130485 | |
| i | 69019 | 5.7% |
| e | 68994 | 5.7% |
| t | 68945 | 5.7% |
| P | 496 | < 0.1% |
| Other values (7) | 867 | 0.1% |
Common
| Value | Count | Frequency (%) |
| 68920 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1283540 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| C | 268325 | |
| a | 199901 | |
| r | 138767 | |
| d | 138336 | |
| s | 130485 | |
| h | 130485 | |
| i | 69019 | 5.4% |
| e | 68994 | 5.4% |
| t | 68945 | 5.4% |
| 68920 | 5.4% | |
| Other values (8) | 1363 | 0.1% |
company
Categorical
MISSING 
| Distinct | 28 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 93668 |
| Missing (%) | 46.8% |
| Memory size | 1.5 MiB |
| Chicago Carriage Cab Corp | |
|---|---|
| 303 Taxi | |
| City Service | |
| Medallion Leasin | |
| Taxi Affiliation Service Yellow | |
| Other values (23) |
Length
| Max length | 36 |
|---|---|
| Median length | 31 |
| Mean length | 16.397105 |
| Min length | 8 |
Characters and Unicode
| Total characters | 1743537 |
|---|---|
| Distinct characters | 45 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Metro Group |
|---|---|
| 2nd row | 303 Taxi |
| 3rd row | 303 Taxi |
| 4th row | 303 Taxi |
| 5th row | 303 Taxi |
Common Values
| Value | Count | Frequency (%) |
| Chicago Carriage Cab Corp | 18457 | 9.2% |
| 303 Taxi | 17705 | 8.9% |
| City Service | 9989 | 5.0% |
| Medallion Leasin | 9306 | 4.7% |
| Taxi Affiliation Service Yellow | 8937 | 4.5% |
| Sun Taxi | 8625 | 4.3% |
| Globe Taxi | 6387 | 3.2% |
| Metro Group | 5986 | 3.0% |
| Yellow Cab | 3148 | 1.6% |
| Nova Taxi Affiliation Llc | 3003 | 1.5% |
| Other values (18) | 14789 | 7.4% |
| (Missing) | 93668 |
Length
| Value | Count | Frequency (%) |
| taxi | 55781 | |
| cab | 24777 | 8.5% |
| chicago | 20632 | 7.1% |
| service | 19388 | 6.7% |
| carriage | 18457 | 6.4% |
| corp | 18457 | 6.4% |
| 303 | 17705 | 6.1% |
| affiliation | 13872 | 4.8% |
| yellow | 12085 | 4.2% |
| city | 9989 | 3.4% |
| Other values (34) | 79471 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 193073 | 11.1% |
| a | 190409 | 10.9% |
| 184282 | 10.6% | |
| e | 121771 | 7.0% |
| o | 109079 | 6.3% |
| r | 96355 | 5.5% |
| C | 95243 | 5.5% |
| l | 68656 | 3.9% |
| T | 56715 | 3.3% |
| x | 56715 | 3.3% |
| Other values (35) | 571239 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1231353 | |
| Uppercase Letter | 271025 | 15.5% |
| Space Separator | 184282 | 10.6% |
| Decimal Number | 56877 | 3.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 193073 | |
| a | 190409 | |
| e | 121771 | |
| o | 109079 | |
| r | 96355 | 7.8% |
| l | 68656 | 5.6% |
| x | 56715 | 4.6% |
| c | 52476 | 4.3% |
| n | 48770 | 4.0% |
| t | 41784 | 3.4% |
| Other values (13) | 252265 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 95243 | |
| T | 56715 | |
| S | 30208 | 11.1% |
| A | 17595 | 6.5% |
| M | 15579 | 5.7% |
| G | 13136 | 4.8% |
| L | 12485 | 4.6% |
| Y | 12085 | 4.5% |
| P | 5606 | 2.1% |
| N | 4942 | 1.8% |
| Other values (6) | 7431 | 2.7% |
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 35410 | |
| 0 | 17705 | |
| 2 | 1878 | 3.3% |
| 4 | 1878 | 3.3% |
| 5 | 6 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 184282 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1502378 | |
| Common | 241159 | 13.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 193073 | |
| a | 190409 | |
| e | 121771 | 8.1% |
| o | 109079 | 7.3% |
| r | 96355 | 6.4% |
| C | 95243 | 6.3% |
| l | 68656 | 4.6% |
| T | 56715 | 3.8% |
| x | 56715 | 3.8% |
| c | 52476 | 3.5% |
| Other values (29) | 461886 |
Common
| Value | Count | Frequency (%) |
| 184282 | ||
| 3 | 35410 | 14.7% |
| 0 | 17705 | 7.3% |
| 2 | 1878 | 0.8% |
| 4 | 1878 | 0.8% |
| 5 | 6 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1743537 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 193073 | 11.1% |
| a | 190409 | 10.9% |
| 184282 | 10.6% | |
| e | 121771 | 7.0% |
| o | 109079 | 6.3% |
| r | 96355 | 5.5% |
| C | 95243 | 5.5% |
| l | 68656 | 3.9% |
| T | 56715 | 3.3% |
| x | 56715 | 3.3% |
| Other values (35) | 571239 |
pickup_latitude
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 313 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 29002 |
| Missing (%) | 14.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 41.910256 |
| Minimum | 41.660136 |
|---|---|
| Maximum | 42.016046 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 41.660136 |
|---|---|
| 5-th percentile | 41.857184 |
| Q1 | 41.880994 |
| median | 41.899156 |
| Q3 | 41.944227 |
| 95-th percentile | 41.980264 |
| Maximum | 42.016046 |
| Range | 0.35591044 |
| Interquartile range (IQR) | 0.06323213 |
Descriptive statistics
| Standard deviation | 0.04677588 |
|---|---|
| Coefficient of variation (CV) | 0.0011160963 |
| Kurtosis | 0.32812152 |
| Mean | 41.910256 |
| Median Absolute Deviation (MAD) | 0.02029003 |
| Skewness | -0.097765595 |
| Sum | 7166569.9 |
| Variance | 0.0021879829 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 41.98026432 | 16272 | 8.1% |
| 41.89960211 | 14446 | 7.2% |
| 41.88498719 | 10377 | 5.2% |
| 41.97907082 | 8874 | 4.4% |
| 41.9442266 | 8688 | 4.3% |
| 41.88099447 | 8163 | 4.1% |
| 41.89250778 | 7072 | 3.5% |
| 41.87886558 | 6830 | 3.4% |
| 41.92268628 | 5112 | 2.6% |
| 41.96581197 | 4942 | 2.5% |
| Other values (303) | 80222 | |
| (Missing) | 29002 | 14.5% |
| Value | Count | Frequency (%) |
| 41.66013605 | 1 | < 0.1% |
| 41.66367065 | 3 | < 0.1% |
| 41.6738199 | 3 | < 0.1% |
| 41.68972991 | 14 | |
| 41.69063335 | 9 | |
| 41.69487897 | 3 | < 0.1% |
| 41.70612575 | 4 | < 0.1% |
| 41.70658788 | 11 | |
| 41.70731145 | 4 | < 0.1% |
| 41.71314861 | 3 | < 0.1% |
| Value | Count | Frequency (%) |
| 42.01604649 | 1 | < 0.1% |
| 42.01571991 | 1 | < 0.1% |
| 42.01569675 | 35 | < 0.1% |
| 42.00962288 | 1033 | |
| 42.00941255 | 1 | < 0.1% |
| 42.00761259 | 18 | < 0.1% |
| 42.00627886 | 1 | < 0.1% |
| 42.00555976 | 5 | < 0.1% |
| 42.00476456 | 3 | < 0.1% |
| 42.00451749 | 1 | < 0.1% |
pickup_longitude
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 313 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 29002 |
| Missing (%) | 14.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -87.684178 |
| Minimum | -87.913625 |
|---|---|
| Maximum | -87.534903 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 170998 |
| Negative (%) | 85.5% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | -87.913625 |
|---|---|
| 5-th percentile | -87.913625 |
| Q1 | -87.676356 |
| median | -87.633308 |
| Q3 | -87.625192 |
| 95-th percentile | -87.618868 |
| Maximum | -87.534903 |
| Range | 0.3787217 |
| Interquartile range (IQR) | 0.05116385 |
Descriptive statistics
| Standard deviation | 0.099087132 |
|---|---|
| Coefficient of variation (CV) | -0.0011300458 |
| Kurtosis | 1.067884 |
| Mean | -87.684178 |
| Median Absolute Deviation (MAD) | 0.01443968 |
| Skewness | -1.6441395 |
| Sum | -14993819 |
| Variance | 0.0098182598 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -87.9136246 | 16272 | 8.1% |
| -87.63330804 | 14446 | 7.2% |
| -87.62099291 | 10377 | 5.2% |
| -87.90303966 | 8874 | 4.4% |
| -87.65599818 | 8688 | 4.3% |
| -87.63274649 | 8163 | 4.1% |
| -87.62621491 | 7072 | 3.5% |
| -87.62519214 | 6830 | 3.4% |
| -87.64948873 | 5112 | 2.6% |
| -87.65587879 | 4942 | 2.5% |
| Other values (303) | 80222 | |
| (Missing) | 29002 | 14.5% |
| Value | Count | Frequency (%) |
| -87.9136246 | 16272 | |
| -87.90303966 | 8874 | |
| -87.90188584 | 5 | < 0.1% |
| -87.8773054 | 15 | < 0.1% |
| -87.84435949 | 1 | < 0.1% |
| -87.84158643 | 3 | < 0.1% |
| -87.81378103 | 18 | < 0.1% |
| -87.80602 | 86 | < 0.1% |
| -87.80453201 | 113 | 0.1% |
| -87.79803218 | 136 | 0.1% |
| Value | Count | Frequency (%) |
| -87.5349029 | 4 | < 0.1% |
| -87.54093551 | 3 | < 0.1% |
| -87.5514282 | 31 | < 0.1% |
| -87.57005827 | 9 | < 0.1% |
| -87.57271713 | 9 | < 0.1% |
| -87.57278199 | 43 | < 0.1% |
| -87.5823657 | 1 | < 0.1% |
| -87.58314372 | 195 | |
| -87.58634832 | 19 | < 0.1% |
| -87.58747926 | 10 | < 0.1% |
pickup_location
Text
MISSING 
| Distinct | 313 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 29002 |
| Missing (%) | 14.5% |
| Memory size | 1.5 MiB |
Length
| Max length | 41 |
|---|---|
| Median length | 36 |
| Mean length | 35.827413 |
| Min length | 32 |
Characters and Unicode
| Total characters | 6126416 |
|---|---|
| Distinct characters | 20 |
| Distinct categories | 7 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 36 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | POINT (-87.7215590627 41.968069) |
|---|---|
| 2nd row | POINT (-87.7215590627 41.968069) |
| 3rd row | POINT (-87.7215590627 41.968069) |
| 4th row | POINT (-87.7215590627 41.968069) |
| 5th row | POINT (-87.7215590627 41.968069) |
| Value | Count | Frequency (%) |
| point | 170998 | |
| 41.9802643146 | 16272 | 3.2% |
| 87.913624596 | 16272 | 3.2% |
| 87.6333080367 | 14446 | 2.8% |
| 41.899602111 | 14446 | 2.8% |
| 87.6209929134 | 10377 | 2.0% |
| 41.8849871918 | 10377 | 2.0% |
| 87.9030396611 | 8874 | 1.7% |
| 41.9790708201 | 8874 | 1.7% |
| 87.6559981815 | 8688 | 1.7% |
| Other values (617) | 233370 |
Most occurring characters
| Value | Count | Frequency (%) |
| 8 | 568150 | 9.3% |
| 1 | 523075 | 8.5% |
| 4 | 482693 | 7.9% |
| 9 | 465332 | 7.6% |
| 6 | 453056 | 7.4% |
| 7 | 426928 | 7.0% |
| . | 341996 | 5.6% |
| 341996 | 5.6% | |
| 2 | 326817 | 5.3% |
| 0 | 318617 | 5.2% |
| Other values (10) | 1877756 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 4074440 | |
| Uppercase Letter | 854990 | 14.0% |
| Other Punctuation | 341996 | 5.6% |
| Space Separator | 341996 | 5.6% |
| Dash Punctuation | 170998 | 2.8% |
| Open Punctuation | 170998 | 2.8% |
| Close Punctuation | 170998 | 2.8% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 8 | 568150 | |
| 1 | 523075 | |
| 4 | 482693 | |
| 9 | 465332 | |
| 6 | 453056 | |
| 7 | 426928 | |
| 2 | 326817 | |
| 0 | 318617 | |
| 3 | 258523 | |
| 5 | 251249 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 170998 | |
| O | 170998 | |
| T | 170998 | |
| N | 170998 | |
| I | 170998 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 341996 |
Space Separator
| Value | Count | Frequency (%) |
| 341996 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 170998 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 170998 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 170998 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 5271426 | |
| Latin | 854990 | 14.0% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 8 | 568150 | |
| 1 | 523075 | |
| 4 | 482693 | |
| 9 | 465332 | |
| 6 | 453056 | |
| 7 | 426928 | |
| . | 341996 | 6.5% |
| 341996 | 6.5% | |
| 2 | 326817 | 6.2% |
| 0 | 318617 | 6.0% |
| Other values (5) | 1022766 |
Latin
| Value | Count | Frequency (%) |
| P | 170998 | |
| O | 170998 | |
| T | 170998 | |
| N | 170998 | |
| I | 170998 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6126416 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 8 | 568150 | 9.3% |
| 1 | 523075 | 8.5% |
| 4 | 482693 | 7.9% |
| 9 | 465332 | 7.6% |
| 6 | 453056 | 7.4% |
| 7 | 426928 | 7.0% |
| . | 341996 | 5.6% |
| 341996 | 5.6% | |
| 2 | 326817 | 5.3% |
| 0 | 318617 | 5.2% |
| Other values (10) | 1877756 |
dropoff_latitude
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 34 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 73327 |
| Missing (%) | 36.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 41.894141 |
| Minimum | 41.660136 |
|---|---|
| Maximum | 42.00915 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | 41.660136 |
|---|---|
| 5-th percentile | 41.85935 |
| Q1 | 41.85935 |
| median | 41.879255 |
| Q3 | 41.909496 |
| 95-th percentile | 41.965812 |
| Maximum | 42.00915 |
| Range | 0.34901401 |
| Interquartile range (IQR) | 0.05014595 |
Descriptive statistics
| Standard deviation | 0.04060171 |
|---|---|
| Coefficient of variation (CV) | 0.00096915008 |
| Kurtosis | 0.58554817 |
| Mean | 41.894141 |
| Median Absolute Deviation (MAD) | 0.02034703 |
| Skewness | 0.084042677 |
| Sum | 5306856.5 |
| Variance | 0.0016484989 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 41.87925508 | 31775 | |
| 41.89960211 | 27580 | 13.8% |
| 41.85934972 | 26501 | 13.3% |
| 41.96581197 | 13049 | 6.5% |
| 41.92208254 | 7071 | 3.5% |
| 41.968069 | 4889 | 2.4% |
| 41.79409025 | 4739 | 2.4% |
| 41.90949567 | 3493 | 1.7% |
| 41.94315509 | 2221 | 1.1% |
| 41.8979839 | 1594 | 0.8% |
| Other values (24) | 3761 | 1.9% |
| (Missing) | 73327 |
| Value | Count | Frequency (%) |
| 41.66013605 | 7 | < 0.1% |
| 41.70731145 | 27 | < 0.1% |
| 41.72818206 | 35 | < 0.1% |
| 41.74124273 | 155 | 0.1% |
| 41.78303418 | 7 | < 0.1% |
| 41.79409025 | 4739 | |
| 41.82016661 | 2 | < 0.1% |
| 41.82740025 | 1 | < 0.1% |
| 41.83115718 | 3 | < 0.1% |
| 41.83380026 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 42.00915006 | 6 | < 0.1% |
| 42.00476456 | 127 | 0.1% |
| 41.97153938 | 18 | < 0.1% |
| 41.97028889 | 38 | < 0.1% |
| 41.968069 | 4889 | 2.4% |
| 41.96581197 | 13049 | |
| 41.95773557 | 607 | 0.3% |
| 41.94648976 | 287 | 0.1% |
| 41.94315509 | 2221 | 1.1% |
| 41.9428593 | 378 | 0.2% |
dropoff_longitude
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 34 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 73327 |
| Missing (%) | 36.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -87.637959 |
| Minimum | -87.759857 |
|---|---|
| Maximum | -87.534903 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 126673 |
| Negative (%) | 63.3% |
| Memory size | 1.5 MiB |
Quantile statistics
| Minimum | -87.759857 |
|---|---|
| 5-th percentile | -87.683718 |
| Q1 | -87.642649 |
| median | -87.634156 |
| Q3 | -87.630964 |
| 95-th percentile | -87.617358 |
| Maximum | -87.534903 |
| Range | 0.22495412 |
| Interquartile range (IQR) | 0.0116854 |
Descriptive statistics
| Standard deviation | 0.023970551 |
|---|---|
| Coefficient of variation (CV) | -0.00027351791 |
| Kurtosis | 5.0617278 |
| Mean | -87.637959 |
| Median Absolute Deviation (MAD) | 0.00849291 |
| Skewness | -1.6413831 |
| Sum | -11101363 |
| Variance | 0.00057458732 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -87.642649 | 31775 | |
| -87.63330804 | 27580 | 13.8% |
| -87.61735801 | 26501 | 13.3% |
| -87.65587879 | 13049 | 6.5% |
| -87.63415609 | 7071 | 3.5% |
| -87.72155906 | 4889 | 2.4% |
| -87.59231086 | 4739 | 2.4% |
| -87.6309636 | 3493 | 1.7% |
| -87.64069808 | 2221 | 1.1% |
| -87.64149153 | 1594 | 0.8% |
| Other values (24) | 3761 | 1.9% |
| (Missing) | 73327 |
| Value | Count | Frequency (%) |
| -87.75985702 | 38 | < 0.1% |
| -87.7560677 | 1 | < 0.1% |
| -87.73893721 | 18 | < 0.1% |
| -87.72155906 | 4889 | |
| -87.71750386 | 378 | 0.2% |
| -87.7172201 | 762 | 0.4% |
| -87.6945983 | 43 | < 0.1% |
| -87.6937939 | 7 | < 0.1% |
| -87.68971128 | 74 | < 0.1% |
| -87.6837181 | 607 | 0.3% |
| Value | Count | Frequency (%) |
| -87.5349029 | 27 | < 0.1% |
| -87.5514282 | 155 | 0.1% |
| -87.5826303 | 7 | < 0.1% |
| -87.59231086 | 4739 | 2.4% |
| -87.5964756 | 35 | < 0.1% |
| -87.60284764 | 7 | < 0.1% |
| -87.61735801 | 26501 | |
| -87.62149921 | 2 | < 0.1% |
| -87.62408895 | 1 | < 0.1% |
| -87.6309636 | 3493 | 1.7% |
dropoff_location
Categorical
HIGH CORRELATION  MISSING 
| Distinct | 34 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 73327 |
| Missing (%) | 36.7% |
| Memory size | 1.5 MiB |
| POINT (-87.642648998 41.8792550844) | |
|---|---|
| POINT (-87.6333080367 41.899602111) | |
| POINT (-87.6173580061 41.859349715) | |
| POINT (-87.6558787862 41.96581197) | |
| POINT (-87.6341560931 41.922082541) | |
| Other values (29) |
Length
| Max length | 35 |
|---|---|
| Median length | 35 |
| Mean length | 34.704436 |
| Min length | 32 |
Characters and Unicode
| Total characters | 4396115 |
|---|---|
| Distinct characters | 20 |
| Distinct categories | 7 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | POINT (-87.7215590627 41.968069) |
|---|---|
| 2nd row | POINT (-87.7215590627 41.968069) |
| 3rd row | POINT (-87.7215590627 41.968069) |
| 4th row | POINT (-87.7215590627 41.968069) |
| 5th row | POINT (-87.7215590627 41.968069) |
Common Values
| Value | Count | Frequency (%) |
| POINT (-87.642648998 41.8792550844) | 31775 | |
| POINT (-87.6333080367 41.899602111) | 27580 | 13.8% |
| POINT (-87.6173580061 41.859349715) | 26501 | 13.3% |
| POINT (-87.6558787862 41.96581197) | 13049 | 6.5% |
| POINT (-87.6341560931 41.922082541) | 7071 | 3.5% |
| POINT (-87.7215590627 41.968069) | 4889 | 2.4% |
| POINT (-87.592310855 41.794090253) | 4739 | 2.4% |
| POINT (-87.630963601 41.9094956686) | 3493 | 1.7% |
| POINT (-87.640698076 41.9431550855) | 2221 | 1.1% |
| POINT (-87.6414915334 41.897983898) | 1594 | 0.8% |
| Other values (24) | 3761 | 1.9% |
| (Missing) | 73327 |
Length
| Value | Count | Frequency (%) |
| point | 126673 | |
| 87.642648998 | 31775 | 8.4% |
| 41.8792550844 | 31775 | 8.4% |
| 87.6333080367 | 27580 | 7.3% |
| 41.899602111 | 27580 | 7.3% |
| 87.6173580061 | 26501 | 7.0% |
| 41.859349715 | 26501 | 7.0% |
| 87.6558787862 | 13049 | 3.4% |
| 41.96581197 | 13049 | 3.4% |
| 87.6341560931 | 7071 | 1.9% |
| Other values (59) | 48465 | 12.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| 8 | 452151 | 10.3% |
| 1 | 364030 | 8.3% |
| 4 | 318135 | 7.2% |
| 6 | 304539 | 6.9% |
| 9 | 303711 | 6.9% |
| 7 | 302523 | 6.9% |
| . | 253346 | 5.8% |
| 253346 | 5.8% | |
| 5 | 245221 | 5.6% |
| 0 | 228209 | 5.2% |
| Other values (10) | 1370904 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2876039 | |
| Uppercase Letter | 633365 | 14.4% |
| Other Punctuation | 253346 | 5.8% |
| Space Separator | 253346 | 5.8% |
| Dash Punctuation | 126673 | 2.9% |
| Open Punctuation | 126673 | 2.9% |
| Close Punctuation | 126673 | 2.9% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 8 | 452151 | |
| 1 | 364030 | |
| 4 | 318135 | |
| 6 | 304539 | |
| 9 | 303711 | |
| 7 | 302523 | |
| 5 | 245221 | |
| 0 | 228209 | |
| 3 | 206776 | |
| 2 | 150744 | 5.2% |
Uppercase Letter
| Value | Count | Frequency (%) |
| O | 126673 | |
| T | 126673 | |
| N | 126673 | |
| I | 126673 | |
| P | 126673 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 253346 |
Space Separator
| Value | Count | Frequency (%) |
| 253346 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 126673 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 126673 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 126673 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3762750 | |
| Latin | 633365 | 14.4% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 8 | 452151 | |
| 1 | 364030 | |
| 4 | 318135 | |
| 6 | 304539 | |
| 9 | 303711 | |
| 7 | 302523 | |
| . | 253346 | 6.7% |
| 253346 | 6.7% | |
| 5 | 245221 | 6.5% |
| 0 | 228209 | 6.1% |
| Other values (5) | 737539 |
Latin
| Value | Count | Frequency (%) |
| O | 126673 | |
| T | 126673 | |
| N | 126673 | |
| I | 126673 | |
| P | 126673 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4396115 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 8 | 452151 | 10.3% |
| 1 | 364030 | 8.3% |
| 4 | 318135 | 7.2% |
| 6 | 304539 | 6.9% |
| 9 | 303711 | 6.9% |
| 7 | 302523 | 6.9% |
| . | 253346 | 5.8% |
| 253346 | 5.8% | |
| 5 | 245221 | 5.6% |
| 0 | 228209 | 5.2% |
| Other values (10) | 1370904 |
| company | dropoff_census_tract | dropoff_community_area | dropoff_latitude | dropoff_location | dropoff_longitude | extras | fare | payment_type | pickup_census_tract | pickup_community_area | pickup_latitude | pickup_longitude | tips | tolls | trip_miles | trip_seconds | trip_total | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| company | 1.000 | -0.036 | -0.037 | 0.039 | 0.090 | -0.009 | 0.145 | 0.203 | 0.135 | -0.048 | -0.059 | 0.046 | -0.008 | 0.090 | 0.004 | 0.026 | 0.072 | 0.205 |
| dropoff_census_tract | -0.036 | 1.000 | 0.979 | -0.968 | 1.000 | 0.615 | 0.144 | 0.153 | 0.028 | 0.242 | 0.242 | -0.197 | 0.051 | 0.009 | 0.017 | 0.134 | 0.134 | 0.158 |
| dropoff_community_area | -0.037 | 0.979 | 1.000 | -0.940 | 1.000 | 0.565 | 0.143 | 0.037 | 0.048 | 0.200 | 0.347 | -0.482 | 0.430 | 0.035 | -0.001 | -0.001 | 0.091 | 0.060 |
| dropoff_latitude | 0.039 | -0.968 | -0.940 | 1.000 | 1.000 | -0.658 | -0.136 | -0.013 | 0.040 | -0.199 | -0.338 | 0.506 | -0.456 | -0.046 | 0.004 | 0.024 | -0.066 | -0.040 |
| dropoff_location | 0.090 | 1.000 | 1.000 | 1.000 | 1.000 | -1.000 | -0.115 | -0.156 | 0.061 | -0.148 | -0.192 | 0.246 | -0.135 | -0.066 | 0.004 | -0.157 | -0.126 | -0.163 |
| dropoff_longitude | -0.009 | 0.615 | 0.565 | -0.658 | -1.000 | 1.000 | 0.115 | 0.156 | 0.041 | 0.148 | 0.192 | -0.246 | 0.135 | 0.066 | -0.004 | 0.157 | 0.126 | 0.163 |
| extras | 0.145 | 0.144 | 0.143 | -0.136 | -0.115 | 0.115 | 1.000 | 0.479 | 0.104 | 0.259 | 0.388 | 0.200 | -0.335 | 0.219 | 0.030 | 0.425 | 0.411 | 0.559 |
| fare | 0.203 | 0.153 | 0.037 | -0.013 | -0.156 | 0.156 | 0.479 | 1.000 | 0.000 | 0.129 | 0.270 | 0.232 | -0.393 | 0.275 | 0.027 | 0.865 | 0.858 | 0.971 |
| payment_type | 0.135 | 0.028 | 0.048 | 0.040 | 0.061 | 0.041 | 0.104 | 0.000 | 1.000 | 0.101 | 0.132 | 0.023 | -0.059 | 0.893 | 0.008 | 0.196 | 0.182 | 0.361 |
| pickup_census_tract | -0.048 | 0.242 | 0.200 | -0.199 | -0.148 | 0.148 | 0.259 | 0.129 | 0.101 | 1.000 | 0.901 | -0.354 | -0.384 | 0.138 | 0.020 | 0.140 | 0.145 | 0.156 |
| pickup_community_area | -0.059 | 0.242 | 0.347 | -0.338 | -0.192 | 0.192 | 0.388 | 0.270 | 0.132 | 0.901 | 1.000 | -0.118 | -0.219 | 0.170 | 0.018 | 0.285 | 0.265 | 0.304 |
| pickup_latitude | 0.046 | -0.197 | -0.482 | 0.506 | 0.246 | -0.246 | 0.200 | 0.232 | 0.023 | -0.354 | -0.118 | 1.000 | -0.656 | 0.058 | 0.014 | 0.270 | 0.199 | 0.228 |
| pickup_longitude | -0.008 | 0.051 | 0.430 | -0.456 | -0.135 | 0.135 | -0.335 | -0.393 | -0.059 | -0.384 | -0.219 | -0.656 | 1.000 | -0.114 | -0.016 | -0.436 | -0.350 | -0.393 |
| tips | 0.090 | 0.009 | 0.035 | -0.046 | -0.066 | 0.066 | 0.219 | 0.275 | 0.893 | 0.138 | 0.170 | 0.058 | -0.114 | 1.000 | 0.012 | 0.254 | 0.240 | 0.415 |
| tolls | 0.004 | 0.017 | -0.001 | 0.004 | 0.004 | -0.004 | 0.030 | 0.027 | 0.008 | 0.020 | 0.018 | 0.014 | -0.016 | 0.012 | 1.000 | 0.020 | 0.021 | 0.032 |
| trip_miles | 0.026 | 0.134 | -0.001 | 0.024 | -0.157 | 0.157 | 0.425 | 0.865 | 0.196 | 0.140 | 0.285 | 0.270 | -0.436 | 0.254 | 0.020 | 1.000 | 0.842 | 0.842 |
| trip_seconds | 0.072 | 0.134 | 0.091 | -0.066 | -0.126 | 0.126 | 0.411 | 0.858 | 0.182 | 0.145 | 0.265 | 0.199 | -0.350 | 0.240 | 0.021 | 0.842 | 1.000 | 0.832 |
| trip_total | 0.205 | 0.158 | 0.060 | -0.040 | -0.163 | 0.163 | 0.559 | 0.971 | 0.361 | 0.156 | 0.304 | 0.228 | -0.393 | 0.415 | 0.032 | 0.842 | 0.832 | 1.000 |
| unique_key | trip_start_timestamp | trip_end_timestamp | trip_seconds | trip_miles | pickup_census_tract | dropoff_census_tract | pickup_community_area | dropoff_community_area | fare | tips | tolls | extras | trip_total | payment_type | company | pickup_latitude | pickup_longitude | pickup_location | dropoff_latitude | dropoff_longitude | dropoff_location | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | e2afbb4f62fb3865c4d928a39e8a4d1e711ea8da | 2017-11-03 11:45:00 UTC | 2017-11-03 11:45:00 UTC | 365.0 | 2.3 | NaN | NaN | NaN | NaN | 7.20 | 0.0 | NaN | 0.0 | 7.20 | Cash | Metro Group | NaN | NaN | NaN | NaN | NaN | NaN |
| 1 | a663e8249660b7a3ae13c9a39378ff495564e8e6 | 2016-10-28 20:30:00 UTC | 2016-10-28 20:30:00 UTC | 192.0 | 1.4 | NaN | NaN | NaN | NaN | 5.60 | 0.0 | NaN | 0.0 | 5.60 | Cash | 303 Taxi | NaN | NaN | NaN | NaN | NaN | NaN |
| 2 | a99a7309aea3cf70eed1644c254eb30b150ac4f2 | 2016-10-28 20:45:00 UTC | 2016-10-28 20:45:00 UTC | 328.0 | 2.1 | NaN | NaN | NaN | NaN | 7.60 | 0.0 | NaN | 0.0 | 7.60 | Cash | 303 Taxi | NaN | NaN | NaN | NaN | NaN | NaN |
| 3 | ad0d9e702d67b0e9e7b85dd0750605ff06389c4f | 2016-10-28 21:45:00 UTC | 2016-10-28 22:00:00 UTC | 706.0 | 7.6 | NaN | NaN | NaN | NaN | 20.40 | 0.0 | NaN | 2.0 | 22.40 | Cash | 303 Taxi | NaN | NaN | NaN | NaN | NaN | NaN |
| 4 | 350b48036f17d07f79abf53d04b811da8b6c264c | 2016-10-29 09:45:00 UTC | 2016-10-29 10:30:00 UTC | 3407.0 | 50.7 | NaN | NaN | NaN | NaN | 0.01 | 0.0 | NaN | 0.0 | 0.01 | Cash | 303 Taxi | NaN | NaN | NaN | NaN | NaN | NaN |
| 5 | b5d0ec1472abae045f794a581f42364213ef79ab | 2016-10-29 11:30:00 UTC | 2016-10-29 11:30:00 UTC | 123.0 | 0.2 | NaN | NaN | NaN | NaN | 3.00 | 0.0 | NaN | 0.0 | 3.00 | Cash | 303 Taxi | NaN | NaN | NaN | NaN | NaN | NaN |
| 6 | ac7ce885bf43a27e0e22de0c1e8efd98ebdd1571 | 2016-10-29 12:00:00 UTC | 2016-10-29 12:15:00 UTC | 419.0 | 2.4 | NaN | NaN | NaN | NaN | 7.40 | 0.0 | NaN | 1.0 | 8.40 | Cash | 303 Taxi | NaN | NaN | NaN | NaN | NaN | NaN |
| 7 | 68088a5a378c45781ece9cdb1eebdc2afc4047d3 | 2017-11-03 12:30:00 UTC | 2017-11-03 13:15:00 UTC | 2069.0 | 3.8 | NaN | NaN | NaN | NaN | 20.20 | 0.0 | NaN | 0.0 | 20.20 | Cash | Metro Group | NaN | NaN | NaN | NaN | NaN | NaN |
| 8 | 4f4d0f4eec17eae0c01aa5a62facf62ccbb48e57 | 2017-11-03 13:15:00 UTC | 2017-11-03 13:30:00 UTC | 460.0 | 2.8 | NaN | NaN | NaN | NaN | 8.20 | 0.0 | NaN | 0.0 | 8.20 | Cash | Metro Group | NaN | NaN | NaN | NaN | NaN | NaN |
| 9 | b695916c866d9137a22ad63bb26668d7bdb462c2 | 2017-11-03 15:15:00 UTC | 2017-11-03 15:15:00 UTC | 3.0 | 0.0 | NaN | NaN | NaN | NaN | 2.00 | 0.0 | NaN | 0.0 | 2.00 | Cash | Metro Group | NaN | NaN | NaN | NaN | NaN | NaN |
| unique_key | trip_start_timestamp | trip_end_timestamp | trip_seconds | trip_miles | pickup_census_tract | dropoff_census_tract | pickup_community_area | dropoff_community_area | fare | tips | tolls | extras | trip_total | payment_type | company | pickup_latitude | pickup_longitude | pickup_location | dropoff_latitude | dropoff_longitude | dropoff_location | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 199990 | 87d5788c78dfb92ee8ed9c31908cb2124b064f1d | 2014-04-26 18:30:00 UTC | 2014-04-26 18:45:00 UTC | 1560.0 | 10.8 | NaN | NaN | 8.0 | 41.0 | 24.65 | 0.00 | 0.0 | 0.0 | 24.65 | Cash | NaN | 41.899602 | -87.633308 | POINT (-87.6333080367 41.899602111) | 41.79409 | -87.592311 | POINT (-87.592310855 41.794090253) |
| 199991 | 67e3eb3d78a65c2b913ecd473fd05c677dfd4738 | 2014-05-12 20:45:00 UTC | 2014-05-12 21:15:00 UTC | 1320.0 | 16.5 | NaN | NaN | 8.0 | 41.0 | 35.25 | 0.00 | 0.0 | 0.0 | 35.25 | Cash | NaN | 41.899602 | -87.633308 | POINT (-87.6333080367 41.899602111) | 41.79409 | -87.592311 | POINT (-87.592310855 41.794090253) |
| 199992 | 8b7001ad7784574ebe32b9f3f0d8d8ccd712c0b3 | 2014-05-11 01:45:00 UTC | 2014-05-11 02:15:00 UTC | 1200.0 | 8.9 | NaN | NaN | 8.0 | 41.0 | 21.05 | 4.41 | 0.0 | 1.0 | 26.46 | Credit Card | NaN | 41.899602 | -87.633308 | POINT (-87.6333080367 41.899602111) | 41.79409 | -87.592311 | POINT (-87.592310855 41.794090253) |
| 199993 | 3e23d3638d307b27df4afd31bba5b31ab535a85a | 2014-04-22 21:15:00 UTC | 2014-04-22 21:30:00 UTC | 1380.0 | 8.5 | NaN | NaN | 8.0 | 41.0 | 21.25 | 0.00 | 0.0 | 2.0 | 23.25 | Cash | NaN | 41.899602 | -87.633308 | POINT (-87.6333080367 41.899602111) | 41.79409 | -87.592311 | POINT (-87.592310855 41.794090253) |
| 199994 | 04191a501246d7a82e88bdfb217dabf7beb21819 | 2014-04-25 22:30:00 UTC | 2014-04-25 22:45:00 UTC | 1020.0 | 8.1 | NaN | NaN | 8.0 | 41.0 | 19.05 | 0.00 | 0.0 | 0.0 | 19.05 | Cash | NaN | 41.899602 | -87.633308 | POINT (-87.6333080367 41.899602111) | 41.79409 | -87.592311 | POINT (-87.592310855 41.794090253) |
| 199995 | 7c5eee2d5cfa2efbded24be40fad831ca0e0c983 | 2014-05-07 10:15:00 UTC | 2014-05-07 10:30:00 UTC | 1020.0 | 9.8 | NaN | NaN | 8.0 | 41.0 | 22.25 | 0.00 | 0.0 | 0.0 | 22.25 | Cash | NaN | 41.899602 | -87.633308 | POINT (-87.6333080367 41.899602111) | 41.79409 | -87.592311 | POINT (-87.592310855 41.794090253) |
| 199996 | f8d6826d7b5b1e5f33a8fe6f397d0650c4d4e515 | 2014-05-23 11:00:00 UTC | 2014-05-23 11:30:00 UTC | 1500.0 | 8.7 | NaN | NaN | 8.0 | 41.0 | 22.25 | 1.50 | 0.0 | 0.0 | 23.75 | Credit Card | NaN | 41.899602 | -87.633308 | POINT (-87.6333080367 41.899602111) | 41.79409 | -87.592311 | POINT (-87.592310855 41.794090253) |
| 199997 | 74c5df50d09536ef33bc928e5d20a422a015e0c5 | 2014-05-03 03:45:00 UTC | 2014-05-03 04:00:00 UTC | 1200.0 | 7.9 | NaN | NaN | 8.0 | 41.0 | 18.85 | 3.77 | 0.0 | 0.0 | 22.62 | Credit Card | NaN | 41.899602 | -87.633308 | POINT (-87.6333080367 41.899602111) | 41.79409 | -87.592311 | POINT (-87.592310855 41.794090253) |
| 199998 | ece539fddea592c9a9929013d9b37e6463156ac5 | 2014-06-07 03:30:00 UTC | 2014-06-07 03:45:00 UTC | 1080.0 | 10.1 | NaN | NaN | 8.0 | 41.0 | 22.45 | 3.00 | 0.0 | 0.0 | 25.45 | Credit Card | NaN | 41.899602 | -87.633308 | POINT (-87.6333080367 41.899602111) | 41.79409 | -87.592311 | POINT (-87.592310855 41.794090253) |
| 199999 | bb053bfdb1dd7024c75e9cad526201dda0b75523 | 2014-05-14 20:30:00 UTC | 2014-05-14 20:45:00 UTC | 1080.0 | 8.2 | NaN | NaN | 8.0 | 41.0 | 19.45 | 0.00 | 0.0 | 0.0 | 19.45 | Cash | NaN | 41.899602 | -87.633308 | POINT (-87.6333080367 41.899602111) | 41.79409 | -87.592311 | POINT (-87.592310855 41.794090253) |